Simplify Gramian normalization by PierreQuinton · Pull Request #302 · SimplexLab/TorchJD

PierreQuinton · 2025-04-06T09:01:14Z

Use the Frobenius norm instead of the spectral norm
Replace compute_normalized_gramian by normalize since the normalization is now composable with the gramian computation
Remove compute_regularized_normalized_gramian (in favor of composition of simpler functions)
Add test_scale_invariant
Make gramian_utils functions public to their package
Update the changelog entry relative to the projection changes of UPGrad and DualProj
Add changelog entry relative to the normalization changes of UPGrad, DualProj and CAGrad

Now uses the Frobenius norm instead of the spectral norm.

The advantage is that it unifies the way we compute Gramians, and is therefore a safe step in the direction of autogram.

TODO:

Fix conflict
Add changelog entry
Test this in the trajectories repo, with various values of the norm_eps param
Maybe change the default norm_eps of UPGrad and DualProj
Maybe run a jde training to double check that this works when doing deep learning
Remove the breaking-change label if it seems fine
Run test_scale_invariant on GPU
Improve the parametrization of test_scale_invariant
Try to reduce the atol of LUS property => Not possible, still due to UPGrad. This is comforting in the sense that this seems very similar to what we had before.

TODO after merging:

Add UPGrad to LibMTL since this normalization will be ok with earlier versions of torch

codecov · 2025-04-06T09:02:25Z

Codecov Report

All modified and coverable lines are covered by tests ✅

Files with missing lines	Coverage Δ
src/torchjd/aggregation/_gramian_utils.py	`100.00% <100.00%> (ø)`
src/torchjd/aggregation/cagrad.py	`100.00% <100.00%> (ø)`
src/torchjd/aggregation/dualproj.py	`100.00% <100.00%> (ø)`
src/torchjd/aggregation/mgda.py	`100.00% <100.00%> (ø)`
src/torchjd/aggregation/upgrad.py	`100.00% <100.00%> (ø)`

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

ValerianRey · 2025-04-06T10:06:29Z

How can we test that this is not destroying the performances of UPGrad? Do you think unit tests are sufficient for this? I think we should also use the trajectories project to verify empirically that the trajectories still look good (that will depend on the value of norm_eps of course). Maybe we should even run one of the experiments of jde to verify that we can still obtain roughly similar performance.

I think we would also have to change the default value of norm_eps in UPGrad and DualProj.

Lastly, this clearly deserves a changelog entry. Arguably, this is not a breaking change so we can remove this label if you think it's not so different.

PierreQuinton · 2025-04-07T13:26:56Z

As a partial answer to your question, I added a test that proves that project_weights is scaling invariant on the Gramian. So the normalization of the Gramian does not change anything in the normal regime (not too high, not too low).

PierreQuinton · 2025-04-08T06:21:37Z

Additionally, the scaling factor that we had versus what we have have a ratio of at most $\sqrt{m}$. This is because we use to scale by the largest singular value which is the square root of the maximal eigen value of the Gramian. Now we scale by the square root of the trace of the Gramian. The trace of the gramian is the sum of the eigen values and we have $\sigma_{\max}(G)\leq \text{Tr}(G)\leq m \sigma_{\max} (G)$. Taking the square root we get that our new normalization is between the largest singular value and $\sqrt{m}$ times this same singular value.

ValerianRey · 2025-04-09T20:38:47Z

Additionally, the scaling factor that we had versus what we have have a ratio of at most m . This is because we use to scale by the largest singular value which is the square root of the maximal eigen value of the Gramian. Now we scale by the square root of the trace of the Gramian. The trace of the gramian is the sum of the eigen values and we have σ max ( G ) ≤ Tr ( G ) ≤ m σ max ( G ) . Taking the square root we get that our new normalization is between the largest singular value and m times this same singular value.

Very good to know that thanks! So assuming m=100 (which is kind of the maximum realistic value for m that users would have), the difference would be at max of a factor 10. I think this is fine, because from my past experience the norm_eps was not so sensitive and changing it by a factor 10 should not affect the results that much.

tests/unit/aggregation/test_dual_cone_utils.py

src/torchjd/aggregation/_gramian_utils.py

ValerianRey · 2025-04-09T21:05:34Z

Apart from my comments and todos, LGTM.

… of the spectral norm

…ized_gramian` as they are now just combinations of `_compute_gramian`, `_regularize` and `_normalize`

PierreQuinton · 2025-04-10T06:59:35Z

For me, we could have another PR looking at norm_eps and reg_eps, I do not know (yet) how to assess the correct values for those, and I need to think about it a lot more, the current values are fine for now, we will do JDE and trajectories at that moment, this is out of scope. For the atol of the LUS, this will be related to a refactor of the property testers and is also out of scope for this PR.
So for me, this is now ready to merge.

ValerianRey · 2025-04-10T11:54:47Z

Waiting for Windows tests, and then we can merge!

PierreQuinton added package: aggregation cc: refactor Conventional commit type for any refactoring, not user-facing, and not typing or perf improvements labels Apr 6, 2025

PierreQuinton requested a review from ValerianRey April 6, 2025 09:01

ValerianRey added the breaking-change This PR introduces a breaking change. label Apr 6, 2025

ValerianRey assigned PierreQuinton Apr 6, 2025

ValerianRey added this to Autogram and Aggregation Apr 6, 2025

ValerianRey moved this to In Progress in Aggregation Apr 6, 2025

ValerianRey moved this to In Progress in Autogram Apr 6, 2025

PierreQuinton closed this Apr 6, 2025

github-project-automation bot moved this from In Progress to Done in Aggregation Apr 6, 2025

github-project-automation bot moved this from In Progress to Done in Autogram Apr 6, 2025

PierreQuinton reopened this Apr 6, 2025

ValerianRey moved this from Done to In Progress in Autogram Apr 6, 2025

ValerianRey moved this from Done to In Progress in Aggregation Apr 6, 2025

ValerianRey reviewed Apr 9, 2025

View reviewed changes

tests/unit/aggregation/test_dual_cone_utils.py Show resolved Hide resolved

ValerianRey reviewed Apr 9, 2025

View reviewed changes

src/torchjd/aggregation/_gramian_utils.py Outdated Show resolved Hide resolved

PierreQuinton removed the breaking-change This PR introduces a breaking change. label Apr 10, 2025

PierreQuinton added 6 commits April 10, 2025 08:40

Change the normalization of Gramian to use the Frobenius norm instead…

5feef81

… of the spectral norm

Add tests for scaling invariance of _project_weights

9216597

Remove _compute_normalized_regularized_gramian and `_compute_normal…

5fe88e2

…ized_gramian` as they are now just combinations of `_compute_gramian`, `_regularize` and `_normalize`

Remove _compute_normalized_gramian

50bd220

Make functions in _gramian_utils public (for the same package)

3211be8

Change the scaling to be between 1/16 to 16

53e73ab

Update the docstring of normalize

ee06659

PierreQuinton force-pushed the change-gramian-normalization-to-frobenius-norm branch from ac8ddd3 to ee06659 Compare April 10, 2025 06:47

Update changelog

40ae250

Merge branch 'main' into change-gramian-normalization-to-frobenius-norm

feb43e6

ValerianRey approved these changes Apr 10, 2025

View reviewed changes

Add changelog entry

4aea90e

ValerianRey changed the title ~~Change the normalization of Gramian~~ Simplify Gramian normalization Apr 10, 2025

ValerianRey merged commit f0d3eac into main Apr 10, 2025
15 checks passed

github-project-automation bot moved this from In Progress to Done in Autogram Apr 10, 2025

github-project-automation bot moved this from In Progress to Done in Aggregation Apr 10, 2025

ValerianRey deleted the change-gramian-normalization-to-frobenius-norm branch April 10, 2025 11:56

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Simplify Gramian normalization#302

Simplify Gramian normalization#302
ValerianRey merged 10 commits intomainfrom
change-gramian-normalization-to-frobenius-norm

PierreQuinton commented Apr 6, 2025 •

edited by ValerianRey

Loading

Uh oh!

codecov bot commented Apr 6, 2025 •

edited

Loading

Uh oh!

ValerianRey commented Apr 6, 2025 •

edited

Loading

Uh oh!

PierreQuinton commented Apr 7, 2025

Uh oh!

PierreQuinton commented Apr 8, 2025 •

edited

Loading

Uh oh!

ValerianRey commented Apr 9, 2025

Uh oh!

Uh oh!

Uh oh!

ValerianRey commented Apr 9, 2025

Uh oh!

PierreQuinton commented Apr 10, 2025

Uh oh!

ValerianRey commented Apr 10, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

PierreQuinton commented Apr 6, 2025 • edited by ValerianRey Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

codecov bot commented Apr 6, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

ValerianRey commented Apr 6, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

PierreQuinton commented Apr 7, 2025

Uh oh!

PierreQuinton commented Apr 8, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

ValerianRey commented Apr 9, 2025

Uh oh!

Uh oh!

Uh oh!

ValerianRey commented Apr 9, 2025

Uh oh!

PierreQuinton commented Apr 10, 2025

Uh oh!

ValerianRey commented Apr 10, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

PierreQuinton commented Apr 6, 2025 •

edited by ValerianRey

Loading

codecov bot commented Apr 6, 2025 •

edited

Loading

ValerianRey commented Apr 6, 2025 •

edited

Loading

PierreQuinton commented Apr 8, 2025 •

edited

Loading